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Abstract 

Enzyme costs play a major role in the choice of metabolic routes, both in evolution and bioengi¬ 
neering. Given desired fluxes, necessary enzyme levels can be estimated based on known rate laws and 
on a principle of minimal enzyme cost. With logarithmic metabolite levels as free variables, enzyme 
cost functions and constraints in optimality and sampling problems can be handled easily. The set 
of feasible metabolite profiles forms a polytope in log-concentration space, whose points represent all 
possible steady states of a kinetic model. We show that enzyme cost is a convex function on this 
polytope. This makes enzyme cost minimization - finding optimal enzyme profiles and corresponding 
metabolite profiles that realize a desired flux at a minimal cost - a convex optimization problem. 


1 Introduction 

The metabolic fluxes in cells are driven by enzymes, which come at a cost. Translating a given flux 
profile into the necessary enzyme profile, and computing the corresponding enzyme cost, is not a trivial 
task. In kinetic models, a reaction rates v = E ■ r(c) is a product of enzyme level E and an enzyme- 
specific rate given by the rate law r(c). If metabolite levels were known, the enzyme demand could be 
directly computed; the specific enzyme demand E/v = l/r(c) is simply obtained by inverting the rate 
law. However, since metabolite levels are not fixed, the fluxes in a network can be realized by many 
possible enzyme profiles, each with a corresponding metabolite profile. To select a plausible solution, we 
employ an optimality principle: we define an enzyme cost function (for instance, total enzyme mass) and 
choose among all possible enzyme profiles the one with the lowest cost. As a constraint, the corresponding 
metabolite profile must respect physiological ranges and energetic constraints implied by flux directions. 
The enzyme demand in a reaction, at a given desired flux, depends on thermodynamic and kinetic factors. 
To see what each factor contributes, we split the formula for enzyme demands into a product of terms, each 
with a simple interpretation. The reaction rate depends on enzyme level, forward catalytic constant k+ at 
(i.e., the maximal possible forward rate per mM of enzyme), driving force (which determines the relative 
backward flux), and kinetic effects (such as substrate saturation or allosteric regulation) that modify the 
forward flux (see Figure |T|). 
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Figure 1: The catalytic rate of enzymes is decreased by different physical factors. Under ideal conditions, an 
enzyme molecule catalyses its reaction at a maximal rate, given by the enzyme's forward catalytic constant 
(top). The rate is reduced by microscopic reverse fluxes (center) and insufficient availability of substrate 
(incomplete saturation, leading to waiting times between conversion events). As the catalytic rate of the 
enzyme decreases (left), realizing a desired metabolic flux requires increasingly more enzyme (right). 

2 Enzymatic rate laws 


Reactions of the form A ±=r B can be described by the reversible Michaelis-Menten kinetics. A generalized 
form for reactions with multiple substrates (concentrations s ? ;) and products (concentrations^) reads 
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The molecularities to® and mf represent the (positive) stoichiometric coefficients, but they may be scaled 
by a reaction-specific factor which effectively acts like a Hill coefficient. Using a stoichiometric coefficient 
uu and a molecularity m® = 2\nu\ is equivalent to using a Hill coefficient of 2 in the rate law. For reasons 
of thermodynamic consistency, reaction rates must vanish in chemical equilibrium states; to ensure this, 
equilibrium constants and rate constants must satisfy the Haldane relationship |1J 


fcc + at n,(A; M r f 


where .sy and p :l denote to substrate and product levels, respectively. Since the equilibrium constants depend 
on the Gibbs energies of formation as K eq = e~ AlG l RT t they must satisfy Wegscheider conditions [2]: 
the vector of equilibrium constants satisfies lnAT eq = N totT /i, o/ , with the stoichiometric matrix N tot 
for all metabolites and the vector p°' of transformed Gibbs free energies of formation. Accordingly, the 
equilibrium constants must satisfy a Wegscheider condition In A' eq • k = 0 for any thermodynamic cycle k, 
i.e., any nullspace vector of N totT . The denominator D in Eq. (JTJ) depends on the enzyme mechanism. In 
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general, it is a polynomial 


D{c) = l + Y J Mi k \{cT lik (3) 

k i 

of the metabolite concentrations with positive coefficients Mik and exponents niuk■ For examples of such 
denominators, see appendix[A] In the underlying enzyme mechanism, each sum term (index k) represents 
a binding state of the enzyme. The exponents muk indicate the numbers of reactant molecules bound 
in one binding state and the prefactors encode the binding energies. The sum term 1 represents the 
unbound enzyme. The highest-order substrate term reads JI, { s i/Kf A ) m ^ and the highest-order product 
term reads Y\i{pi/K^) mi . In addition, the denominator may contain additive or multiplicative terms 
for allosteric activation and inhibition. The exponents muk are usually positive integer numbers. With 
allosteric regulation, however, there can also be denominator terms of the form Ks/s. 


3 Separable rate laws and enzyme cost 


Following [3], we consider general reversible rate laws and factorize them into 

v = £-fc cat V h 'V dn -?7 res , 


(4) 


where fc cat = k// at is the forward catalytic constant. For an example of such a factorization, see appendix 
m The energetic efficiency 
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depends on the mass-action ratio T (e.g., T = p/s for unimolecular reactions) and on the equilibrium 
constant K eq , or briefly on the driving force 0 = —A r G/RT. Note that our driving forces are defined 
via molecularities, not via the stoichiometric coefficients; to allow for a consistent equilibrium state, all 
reactants within a reaction must show the same Hill coefficient [4]. The relationship T/A' eq = e _e links 
concentrations to driving forces and holds for ideal chemical mixtures with constant activity coefficients. 
The kinetic efficiency r/ klIi depends on the rate law and can be derived from the rate law’s denominator. 
For a general reversible rate law, the kinetic efficiency would read 

^in = i usi/xyr* 

D(s 1 ,s 2 ,..,Pi,P2,~) 


where the substrate-dependent numerator i (si/Kf /l ) m * stems from the positive numerator term in the 
rate law and the denominator D is given by the rate law denominator. A factorized formula ([4]), called 
separable rate law pj], exists for reactions of arbitrary stoichiometry (for examples, see SI |A]). The fac¬ 
torization is always possible even for general rate laws, because rate law numerators must have the form 
^cat n, — k~ at n,:(Tz/AT M zi) m “ for reasons of thermodynamic consistency. 

The terms in the rate law ([4]) depend on metabolite levels in different ways. The first terms, represent 
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(a) Reversible Michaelis-Menten kinetics (factorized, with driving force 9 = —A r G/RT) 
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(b) Enzyme cost function (factorized form) 
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Figure 2: Separable rate law and enzyme cost function, (a) Reversible rate laws can be factorized [3]. 
The example shows a reaction S ^ P with reversible Michaelis-Menten kinetics Eq. (??) and a factor for 
non-competitive allosteric inhibition (inhibitor concentration x). (b) The enzyme cost y (enzyme level e, 
multiplied by the specific enzyme cost h) contains the terms from the rate law in inverse form. By omitting 
some terms (or replacing them by constant numbers), one obtains simplified enzyme cost functions. 


the maximal velocity (the rate at full substrate-saturation, no backward flux, full allosteric activation), while 
the following efficiency terms describe how this velocity is reduced in reality: the factor 77 th describes a 
reduction due to backward fluxes, and the factors ? 7 kln and 77 th describe a further reduction due to incomplete 
substrate saturation and allosteric regulation. While is an enzyme-specific constant (yet, dependent 
on conditions such as pH, ionic strength, or molecular crowding in cells; unit 1/s), the efficiency terms 
are concentration-dependent, unitless, and can vary between 0 and 1. The thermodynamic efficiency 77 th 
depends on the driving force (and thus, indirectly, on metabolite levels) and the equilibrium constant is 
required for its calculation. The kinetic efficiency ? 7 kln depends directly on metabolite levels and contains 
the K m values as parameters. Allosteric regulation can be captured by ? 7 kln (as additive or multiplicative 
terms in the denominator), but non-competitive allosteric regulation can also be described by a separate 
term rj leg . If rate law, flux, and metabolite levels are known, a reaction’s enzyme demand follows from 
Eq. (@1 as 
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By weighting the enzyme demand with an enzyme-specific cost hi, we obtain the cost function 
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Dividing Eq. ([ 8 ]) by vi, we obtain the specific flux cost T; = yi/vi = hi/ri. Eq. (J 8 |) shows which factors 
shape enzyme cost, and how. The first two terms yield the minimal cost Yj 1 ' = (the cost per 

flux under ideal conditions); the following terms further increase this value. To keep enzyme cost low, 
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Figure 3: Enzyme demand in a metabolic pathway, (a) Pathway with reversible Michaelis-Menten kinetics 
(equilibrium constants and K M values are set to 1). The external metabolite levels x and y are fixed, while 
internal levels a and b can vary. Plots (b)-(d) show the enzyme demand for reactions 1, 2, and 3 (enzyme 
levels needed to sustain the desired unit flux). Regions of infeasible metabolite profiles are shown in grey. At 
the edges of the admissible region (where chemical equilibrium would obtain), the thermodynamic driving 
force goes to zero; this must be compensated by a high enzyme level. The enzyme demand in reaction 1 
(shown in (b)), for instance, increases with the level of A (x-axis) and goes to infinity as the mass-action 
ratio a/x reaches the equilibrium constant (where the driving force vanishes), (e) Total enzyme demand 
(sum of enzyme levels). The metabolite polytope - the intersection of feasible regions for all reactions - is 
a triangle, and the enzyme demand is a cup-shaped function on this triangle. The minimum point marks 
the optimal metabolite levels, from which optimal enzyme levels can be computed. 


the k cat values, driving forces (i.e. imbalance in substrate and product levels), and substrate saturation 
(i.e. high substrate concentrations) should be high. For a pathway with desired fluxes vi and known 
log-concentrations Xi = lnCj, the total cost reads 

y pw (x) = ^2yi(v,c) = ^2 hi £i{v h c). (9) 

i i 

Setting hi = 1, we obtain the total enzyme demand (as a concentration in mM), and setting hi = mi 
(protein mass in Daltons), we obtain the mass concentration (in gram protein per cell dry weight) as a 
special case. 


4 The metabolite polytope represents the states of a kinetic model 

A metabolic network (with given flux directions, equilibrium constants, and metabolite bounds) defines a 
convex metabolite polytope V in the space of log-concentrations Xi = lncj (where c t is measured in units 
of the standard concentration c„ = 1 mM). An example is shown in Figure[3] In general, the polytope arises 
from two sorts of inequality constraints: (i) Upper and lower bounds < x t < x™ ax for metabolite 
levels yield a box-shaped metabolite polytope; some metabolite levels may also be constrained to a fixed 
value, (ii) Reaction fluxes dissipate Gibbs energy (0; - vi > 0), so the driving forces must be positive in the 
direction of the flux. The resulting constraints 0 < 0; = l nc i further restrict the metabolite 


5 


























polytope; they define E-faces of the polytope (representing an equilibrium condition), where enzyme costs 
rise steeply. The metabolite polytope is a convex polytope in log-concentration space, which contains all 
feasible metabolite profiles. It is bounded by two types of faces: faces that represent an equilibrium in one 
of the reactions ("E-face”), where enzyme cost goes to infinity; and faces stemming from physiological 
metabolite bounds ("P-face"). Minimum points of the enzyme cost function can be inside the polytope or 
on P-faces. The polytope's precise shape depends on the Enzyme Cost Function (ECF) score chosen (i.e, 
on the simplifications applied) and on rate laws, rate constants, and specific enzyme costs hi in the model. 

The metabolite polytope plays a central role in enzyme cost minimization: For a given model and flux 
profile v, the points of the polytope parametrize the set of all possible steady states (e, c, v). Feasible 
metabolite profiles (represented by polytope points) can be uniquely mapped to enzyme profiles, while the 
mapping from enzyme to metabolite profiles need not be unique. The entirety of metabolic states (or 
all steady states) of a kinetic model can be parametrized as follows: we consider the (non-convex) flux 
polytope and construct, for each point, the metabolite polytope. The construction yields all steady states 
(characterized by concentrations, fluxes, enzyme levels) exactly once (while the same enzyme profile may 
appear several times). Using this fact, we can parametrize all metabolites states of a kinetic model in a 
simple and systematic way (for details, see appendix[Q). However, a restriction to stable steady states is 
not easily possible. 


5 Enzyme cost is a convex function on the metabolite polytope 

The enzyme cost functions © and © are differentiable convex functions on the metabolite polytope (proof 
in Sl[D]). Convexity means that an interpolated metabolite vector, on a line between two log-concentration 
vectors x a and Xb, has a cost that is higher than (or at most equal to) the interpolated cost: 

VA G [0,1] : j/ pw (Ax a + (1 - A) x b ) < Ay pw (x a ) + (1 - A) y pw (x b ). (10) 

To show that the ECF scores are convex, we consider the most general rate laws with denominator (|3|) and 
rewrite it in the form 


v 


e • k ta.t • v th ■ n 


^kin 


implying the enzyme cost function 


V = ^2 Vi 
l 


E hivi 1 1 

Ucat th ^kin 

/ Vl Vl 


(ii) 


( 12 ) 


6 



for a pathway. The efficiency terms are given by 


77 th = 1 — e e = 1 - exp A r G°' + n t In a 

i tm = n(y)"" (x>n<?“) - (E«*n-rj («> 

with coefficients a.k € K.+ and a,ik € R. The regulation efficiency rf eg can be neglected because it can 
always be covered by the term ? 7 kln . The cost function (fl2ll with efficiency terms (fl3ll is convex on the 
metabolite polytope. The function stays convex if the investment function H{e) is not linear, but convex. 
Importantly, even though all ECF scores are convex, they may not be strictly convex (in which case there 
would be a < sign, instead of <, in Eq. (jlOp T For instance, simplified ECF scores can be constant in the 
metabolite polytope. Non-strict convexity can arise when the mapping from enzyme to metabolite profiles 
is not unique. Flowever, it is possible to enforce a unique optimum by adding a convex regularization term 
j/ reg , e.g., a quadratic function favoring metabolite levels in the center of the typical concentration range. 
Such terms can be justified by biological side objectives: for instance, keeping metabolite levels away from 
their upper or lower bounds in advance will later allow cells to vary them more flexibly. We can also 
consider a variant of ECM with an additional metabolite-dependent objective z. instead of minimizing the 
enzyme cost alone, we then minimize the difference y p ® (x) = j/ pw (x) — z met (x). For instance, an objective 
z met (lnc) = Yhili (l nc i — lndi ) 2 would represent a preference for metabolite levels close to "ideal" levels 
Ci, with cost weights 7 If z met (x) is strictly concave, —z can be used as a regularization term y reg . The 
resulting effective cost y®® will be strictly convex even if the term z is very small. 

The convexity proof suggests that enzyme levels can be predicted with relatively little effort. Enzyme cost 
minimization uses a metabolic network, a given flux profile v, and possibly kinetic rate laws (with their 
thermodynamic or kinetic constants), and specific enzyme costs. The flux profile may be stationary (like 
flux profiles determined by FBA) or non-stationary (like experimentally measured fluxes, simply inserted 
into a model). In any case, it must be free of thermodynamically unfeasible cycles, and must agree with the 
assumed equilibrium constants and external metabolite levels. If the given flux directions are infeasible, the 
metabolite polytope will be an empty set. To find an optimal state, we choose an ECF score and minimize 
the total enzyme cost within the polytope. Optimal metabolite profiles, enzyme profiles, and enzyme costs 
are obtained by solving the Enzyme Cost Minimization (ECM) problem 

y opt (v) = min x6 -p y pw (x) 

x opt (v) = argmin xgT ,y pw (x) 

e opt (v) = e(v, c opt (v)) (14) 

for log-concentration vectors x = Inc. The total cost y pw (x) (see Eq. (J9]) ) is a sum of enzyme costs ([ 8 ]) 
or simplified ECF scores. If there is no unique optimum for x (because the cost function is constant along 
some subspace, and therefore not structly convex), a unique solution can be enforced by adding a convex 
regularization term 7 / reg (x) to y pw . Since the optimal enzyme levels depend on external metabolite levels, 
they must be recalculated after changes in external conditions. If non-enzymatic reactions (typically with 
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Figure 4: Two ways of framing enzyme allocation as an optimality problem, (a) Benefit-cost optimization. 
Each enzyme profile determines a metabolic state (with state variable vectors v and c and a metabolic 
objective z(v,c)) and an investment h(ei, £ 2 , ..). To predict an optimal enzyme profile, we maximize the 
difference z — h. (b) In ECM, a predefined flux profile is realized by an enzyme profile (and a corresponding 
metabolite profile) with a minimal investment; a concentration objective can be considered in addition. 
Using the metabolite concentrations as free variables makes the problem relatively easy to solve. 


mass-action rate laws) are included in the optimality problem, they contribute to the energetic constraints, 
but not to the enzyme cost function. 


6 Discussion 

In summary, we saw that logarithmic metabolite levels are suitable variables for screening, sampling, and 
optimizion of metabolic states. Due to the mapping from metabolite profiles to enzyme profiles, all feasible 
enzyme profiles can be reached, where bounds on driving forces can be formulated as linear constraints. 
Moreover, using the metabolite log-concentrations as free variables does not only provide a good search 
space, but also facilitates the optimization problem: under general and reasonable assumptions, the ECF 
scores are convex functions on the metabolite polytope. Convexity holds for a wide range of rate laws, 
including rate laws with allosteric regulation. As a consequence, the optimization remains tractable for 
various rate laws and larger metabolic networks. Strict convexity (required for an isolated optimum point) 
can be enforced by adding small regularization terms y les , possibly representing biological side objectives. 

Optimal enzyme allocation in kinetic models can be framed in two main ways. One the one hand, enzyme 
levels can be treated as control variables which determine the metabolic state, and thus the fluxes (see, 
e.g-. El IS])- The aim is to find the enzyme profile that leads to an optimal state (where enzyme cost can 
come into play as a constraint or as a penalty functions). On the other hand (as in [7] 0] and here), one 
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can predefine the fluxes and search for enzyme levels that realize them in an optimal way (here, minimal 
cost is used as the optimality criterion). Both approaches address similar problems and lead to equivalent 
solutions. For instance, if we first maximize a flux at a fixed total enzyme level, as in [§], and then use 
this flux as a constraint in ECM (with identical specific costs for all enzymes), we recover the metabolite 
and enzyme profile from the initial calculation. In fact, both optimality problems can be derived from a 
common general optimality problem by constraining the fluxes or the total enzyme cost. The approaches 
frame the same problem, but in different ways. Using fluxes as a scaffold for model construction has 
several advantages. First, it makes the optimality problem convex. Second, we can precisely specify the 
flux state to be modelled. Third, the flux cost functions y opt (v) can be used in flux analysis. Thus, ECM 
enables more realistic variants of FBA; the cost function may contain additional side objectives scoring the 
metabolite levels. If flux profiles are compared at a given flux benefit, it is only the cost scores that count 
in the optimization, so cost and benefit can be measured in different units. In benefit-cost approaches 
as in Figure [4] (a), enzyme investment and metabolic objective are directly compared and show the same 
physical units. In order to make them comparable, a relative weighting would have to be established, which 
bears the problem of arbitrariness. 

Our fundamental assumption - that enzyme levels are cost-optimized in every moment - is of course 
debatable. Instead, proteins may be expressed to higher amounts to anticipate sudden challenges (exam¬ 
ple: energy production in muscle cells). Preemptive protein expression can avoid costs for rearranging 
the proteome and performance losses during adaptation; however, in ECM it would appear futile. Also 
flexibility in metabolite concentrations can be important, and cells might trade it against enzyme economy. 
Furthermore, enzyme and metabolite levels in cells are not only shaped by demands in a single pathway, 
but also by other pathways outside the model in question. Finally, if proteins are used as an amino acid 
storage, there will be little pressure to keep them at low concentrations. How can our method be useful 
despite all this? First, an account of simple enzyme economy can be a basis for studying more complicated 
optimality requirements afterwards. Second, despite all these points of critique, enzyme economy may be 
the main requirement, e.g., during fast, nitrogen-limited growth. Third, we can study how deviations from 
the optimal state affect enzyme cost, and thus fitness. Finally, ECM can be extended to include more 
objectives and constraints into our pathway model, and thus to account for the surrounding cell. The 
metabolites from our pathway may also be involved in other pathways outside the model. If these other 
pathways demand higher or lower metabolite levels, we can implement this fact in ECM by constraints 
(upper and lower concentration bounds) or by concentration-dependent side objectives z met (c), which pe¬ 
nalize unfavorable metabolite levels. Trade-offs between the pathway in focus and other pathways around 
it can be handled in this way. Alternatively, we can assume that each metabolite level should be close 
to the centre of its allowed range (which also provides flexibility, because it will not easily hit a bound). 
Again, this can be captured by side objectives. 

As a possible application, enzyme cost functions allow us to define non-linear flux costs for use in flux 
balance analysis. ECM is based on a given flux profile. However, since it can be applied to any flux 
profile, it defines a flux cost function y opt (v), which can be applied in flux prediction. FBA with minimal 
fluxes (fmFBA) compares flux profiles at equal benefit (FBA objective) and minimizes their heuristic cost. 
The flux cost functions used are linear (for a predefined choice of flux directions). Linearity simplifies 
calculations, but is not very realistic: first, cost scores like the sum of fluxes do not account for kinetics 


9 


and regulation; second, the costs add linearly when flux distributions are linearly combined. Flux cost 
functions obtained from ECM, and based on a kinetic model, are more realistic. In an mfFBA based on 
such cost functions, one would predefine flux directions, flux bounds, and a flux benefit z v • v = b, and 
assume stationary fluxes; but instead of a linear flux cost, one would minimize the flux cost r/ opt (v). Flux 
costs derived from ECM are concave functions on the flux polytope (with given flux directions). This implies 
that the solutions of the new fmFBA problems will be elementary flux modes, which confirms findings from 
other enzyme optimality approaches [9] [TO]. In fact, the flux cost function y opt (v) can be expected to be 
strictly concave (except for specific cases, e.g. models containing two identical reactions with identical rate 
laws). If this is this case, elementary flux modes are the only solutions. As a consequence, splitting a flux 
profile into elementary modes that run in different compartments or at different time points can be better, 
but never worse than the original flux profile in terms of enzyme cost. 
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Symbol 

Unit 
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Vl 

mM/s 

Metabolite level 

Ci 

mM 
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Xi = \n.(ci/c a ) 

mM 
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£i 

mM 

Reaction rate 

vi{e u c) = £i • r;(c) 

mM/s 

Specific rate 

n = vi/ei 

V s 

Scaled reactant elasticity 

K 

1 

Gibbs free energy of formation 

G'° 

kJ/mol 

Reaction Gibbs energy 

A r G; = A r G'° + nuRT Inc* 

kJ/mol 

Driving force 

0; = -\G'JRT 

1 

Forward/backward catalytic constant 

i,+ u~ 

A/ cat) '''cat 

1/s 

Michaelis-Menten constant 


mM 

Specific enzyme cost 

hi 

D/mM 

Enzyme cost 

yi = hi si 

D 

Total enzyme cost 

y = J2i h i si 

D 

Specific flux cost 

T, 

D/(mM/s) 

Enzyme-optimal cost 

y° pt (v) = min xi „pJ/ pw (x) 

D 


Table 1: Terms and symbols used in enzyme cost minimization. Darwin (D) is a hypothetical fitness unit. 
Reaction directions are defined in such a way that fluxes are positive. To define log-concentrations, we use 
the standard concentration c a = ImM (shown here, but omitted elsewhere for simplicity.) 
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A Kinetic rate laws 

By considering simple enzyme mechanisms with few binding states, we obtain general rate laws applicable 
to all reaction stoichiometries. The rate law denominators to be used in Eq. (|T]) have simple structures 
(containing only few of the possible sum terms, and with prefactors following from a few Michaelis-Menten 
constants) [4|. If denominator terms are omitted, the rate will be overestimated, i.e., enzyme demand and 
costs will be underestimated. First, there are rate laws with denominators 

D (s) = J^[(s i /iGf) m ? 

i 

£ (SP) = + (15) 

* 3 
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which lead to the energetics-based ECF2 scores. The big product terms are called principal substrate and 
product terms. As before, .s t and pj denote substrate and product levels. The first formula assumes that 
substrate levels are high and product levels are low; the second one assumes that substrate and product 
levels are both high. Next, there are rate laws with denominators 


c (is) = 

i 

d (isp) = i+H( Si /Kfr<+~[[(p j /K™r* (i6) 

i 3 


which lead to the saturation-based ECF3 scores. The denominators contain only three possible terms: the 
term 1, the principal substrate term, and the principal product term. To justify these rate laws, we assume 
a strongly cooperative binding between substrates and between products. The first formula assumes low 
product concentrations; the second formula describes the direct-binding modular rate law [4], The direct- 
binding modular rate law is a generalized version of reversible MM kinetics. In the underlying enzyme 
mechanism, the enzyme exists in three states: fully bound with substrates, fully bound with products, or 
empty. If enzymes are a I losterica lly regulated, the rate law denominators contain additive or multiplicative 
terms for regulation [4]. Additive terms can arise from competitive regulation. Multiplicative terms (for 
non-competitive regulation) can be split from the denominator and treated as prefactors in the rate law. 

k i 

Typical choices are A for non-competitive activation and — for non-competitive inhibition, with 
rate constants k A and k 1 and regulator concentration x 0]. Accordingly, allosteric effects can either be 
listed by a separate efficiency term in the factorized ECF formulae, or be included in the kinetic efficiency. 
For instance, the kinetic efficiency term for MM-kinetics with non-competitive inhibition can be split into 


?7 kin = 


s/K s 


(! + *)(! + * + *) 


1 + + kT (1 + X /Ki) 


— ^kin* rfeg 


(17) 


B Factorization of rate laws 


To demonstrate how rate law are factorized, we consider the common modular rate (CM) law [11, 4], a 
generalized form of reversible MM kinetics with the denominator 


d (cm) = Y[(l + Si /K™) m < +Y[(l+ Pj /Kf) m *-1. 


(18) 


In the assumed enzyme mechanism, substrate molecules bind independently, product molecules bind inde¬ 
pendently, and substrate and product binding exclude each other. For a bimolecular reaction A + B ^ 
P + Q, the rate law 


v 


£ 


,+ IT [51 l. \P]\Q] 

cat Ka Kb ca ^ Kp Kq 



(19) 
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can be rewritten as 


Ldiig] 


f-PlfQl 


= ek~; 


K a K b K P K q 


(1 + K + S + I ^ 1 + M 


Kab 


= ek, 


Kp 
1 — e -0 


M 

Kq 


f-PlfQh 

Kpq ) 


' cat Ka Kb ^ 




141 4. 151 1 
K a ^ K b ^ K A b 


_l in j_ M _l LOM\ 

T Kp ' Kn ' Kpn ) 


Kq ^ K P q 


= ek+ t [ l-e-%^ 


(20) 


where we defined the kinetic efficiency 


V 


kin 


Ka Kb /1 
[A][B] 


M 

Ka 


M 

K b 
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I 41 M , in 

I<ab ^ K P 


M 

Kq 


IfUSh 

Kpq ' 


( 21 ) 


and used the Haldane relationship K eq = k® and the identity e 0 = /-^eq- l n the calculation, 

we first separated the k+ at value from the rest of the fraction, and then hid the negative flux term in the 
energetic efficiency term 77 th . 


C Parametrizing all states of a kinetic model 

In a kinetic model with given rate laws and external metabolite concentrations, an enzyme profile (ei, ..e 2 ,..) 
lead to a steady state with metabolite levels c and fluxes v. The following proposition shows how the set 
S of such steady states a = (e,cv) can be easily parametrized. 

Proposition: Consider a kinetic model with rate laws vi = £iri{ c), thermodynamically consistent rate 
constants (satisfying Wegscheider conditions and Haldane relationships), a feasible positive flux profile v, 
and bounds on metabolite levels. Any feasible metabolite profile Inc £ P can be realized by some positive 
enzyme profile e; given the metabolite levels, the enzyme levels are uniquely determined and given by 
£j(lnc) = vi/ri( c), which is a differentiable function on the metabolite polytope. 

Proof: If a metabolite profile c is feasible for the given flux profile v, the specific rates n(c) obtained from 
reveresible rate laws (see Eq. m in appendix) have the same signs as vi, so £1 = Vi/ri( c) is positive on 
the entire metabolite polytope. Since r/(c) is differentiable and does not change its sign on the metabolite 
polytope, £1 (ln c) is differentiable on the metabolite polytope. 

According to our proposition, any thermodynamically feasible metabolite profile can be realized by some 
steady state of the kinetic model (with an appropriate choice of enzyme levels), so the set S of metabolic 
states with a given flux profile v can be characterized by points of the metabolite polytope. In particular, 
the set of kinetically realizable metabolite profiles depends on the equilibrium constants, but not on enzyme- 
specific rate constants. 

With simplified rate laws, the same enzyme profile may be realizable by different metabolite profiles, (i) If 
a metabolite appears in a model but has no impact on any reaction, its concentration can be freely varied, 
independently of the enzyme levels, (ii) With simplified cost scores in which all efficiencies 77 th , ? 7 km , and 
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? 7 reg are taken to be constant, enzyme levels do not depend on metabolite levels, (iii) With simplified 
scores in which ? 7 kln and rf eg are taken to be constant, enzyme costs depend on metabolite levels only 
through the thermodynamic forces. Notably, the vector x = Inc can be varied along directions in the 
nullspace of N totT without affecting the driving forces or enzyme cost. Thus, the enzyme cost scores have 
an invariant subspace on the metabolite polytope (namely the nullspace of N totT ). Under what conditions 
more complicated enzyme cost scores (without regularisation terms) have unique optima remains an open 
question. 

Finally, to parametrize all steady states of a kinetic model, we can follow a two-step procedure in which 
we enumerate all possible flux distributions and, for each of them, all possible enzyme and metabo¬ 
lite profiles. The thermodynamically feasible flux distributions v form a set V, given by V = {v|3x : 
sign(v) = sign(—A r G'(x)}, where x stands for log-concentration profiles. The reaction Gibbs ener¬ 
gies A r G[ = A r G'° + RTJ2i n u Incy depend on the internal and external metabolite levels and on 
the equilibrium constants chosen. According to thermodynamic condition, whether a flux distributions 
is feasible or infeasible depends solely on its sign pattern. By imposing upper and lower bounds and 
the stationarity condition, we can further limit this set and obtain the set of feasible, stationary fluxes 
ystat = {v|v min < v < v max A N v = 0 A 3x : sign(v) = sign(—A r G(x)}. V stat is a (generally non- 
convex) polytope in flux spaces. Each flux distribution v in this set defines a set of possible states S v , one 
can then set S = {(e, c, v)|v £ V A (e, c) € <S V }. 


D Convexity proof for enzyme cost functions 

To prove the convexity of general enzyme cost functions, on the metabolite polytope and at given desired 
fluxes, we first show the convexity of some simple functions appearing in the formula. 


D.l General lemmata 

Lemma 1 The function f(y ) = — ln(l — e v ) is convex in the range y < 0. 
Proof D.l The second derivative 

d 0 ^ 

dy = (1-ev) 2 


is positive for y < 0. 

Lemma 2 The function /(x) = In Efc=i eXk ,s convex. 

Proof D.2 


V 2 /(x) 


Dg'(c)(l T c) c c T 
(l T c) 2 


(where d = e Xi ) 


Vu : u T V 2 /(x)u 


(Ei g ) 2 


> o 
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since u i c i) 2 < (XX c i u i XXX c i) horn the Cauchy-Schwarz inequality. Therefore, the Hessian V 2 /(x) 
is positive semi-definite, which proves that /(x) is convex. 

Lemma 3 For any number v G R+ and vector n G R m , the function — ln(l — ive n x ) is convex over 
{x G R m | ve n x < 1}. 

Proof D.3 This function is a composition off = — ln(l— e v ) from Lemma{l\with the affine transformation 
y = n ■ x + In v, an operation which preserves convexity. 

Lemma 4 For any matrix A G R” xrra and vectors b G R" , the following function is convex overx G M m .' 

ln {j2 eakX+bk ^j ( 22 ) 

where a.; is the ith row of A. 

Proof D.4 This function is a composition off = In ]P " =1 f mm Lemma\^with the affine transformation 
Xi = a.i ■ x + bt, an operation which preserves convexity. 

D.2 The convexity of enzyme cost functions 

Lemma 5 Assume that all enzyme-catalysed reactions in a model behave according to rate laws of the 
type 


v 


,th ^kin 


e-*cV»r 


(23) 


with 77 th and // kl " given by Eq. m with coefficients au G R+ and ciik G R. Assume that the enzyme 
cost function for enzymatic reaction l reads 


Vi 


hi vi 
£1 


hi vi 1 1 


/.cat 


rjth ^kin 

r h r n 


(24) 


Then the total enzyme cost y = Y2i Vi- as 3 function of logarithmic metabolite concentrations (x = In c ), 
is convex. 


Proof D.5 To simplify the efficiency terms, we can use the abbreviations Xi = In a, v = exp(A r G° , /-R , 7 1 ), 
and bk = In : 
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th 
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kin 


1 — e 



(25) 
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If we look at the natural logarithm ofyi, 


In Vl = In - In rtf' - In rf*, (26) 

we see that each of the three terms in the sum is convex in x. The first term is constant with respect to the 
metabolite concentrations and therefore trivially convex. The energetic term, — In 77 th = — ln(l — ue~ n x ), 
is convex according to Lemma\3\ The kinetic term, — Inr/ kln = In (X]fc=i e afc ' x+bfc ), is convex according 
to Lemma\4\ We conclude that yi is convex too, since it is a composition of a convex function (e x ) with 
another convex function (In yi). Finally, the total enzyme cost (y) is convex since it is a sum of convex 
functions: 


y = 

l 


(27) 
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